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Let Nn = {1,2, . . . Elements are drawn from the set Nn with replacement, assuming that 
each element has probability 1/n of being drawn. We determine the limiting distributions for 
the waiting time until the given portion of pairs jj, j G Nn, is sampled. Exact distributions of 
some related random variables and their characteristics are also obtained. 
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1. Introduction 

Combinatorial problems in the theory of probabihty and mathematical statistics have 
been studied extensively. Many of them are formulated in the form of urn models. In 
such problems, one usually considers a sequence of experiments with some stopping rule 
defined a priori and the problem is to determine the exact and/or limit distribution of the 
waiting time until the last experiment. Sums and extreme values of random variables and 
rare events in a sequence of experiments appear naturally in connection with problems of 
this kind. Consequently, many different approaches, methods and techniques have been 
used to investigate combinatorial problems from the probabilistic point of view and a 
number of limit theorems have been proven. The method of characteristic and moment 
generating functions in summing random variables was used by Erdos and Rcnyi (1961), 
Bekessy (1964), Baum and Bihngsley (1965), Hoist (1971), Samuel-Cahn (1974) and Flato 
(1982). The method of embedding in Poisson processes was used by Hoist (1977, 1986). 
For a general list of references concerned this subject, see, for example, Johnson and 
Kotz (1977), Kolchin, Sevastyanov and Chistyakov (1976), Kolchin (1984) and Barbour, 
Hoist and Janson (1992). 

In this paper, the following problem will be studied. We sample with replacement from 
the set Nn = {1, 2, . . . , n}, under the assumption that each element of Nn has probability 
1/n of being drawn, and we are interested in the waiting time until a given portion of 
pairs jj, j G Nn, is sampled. In order to get limit distributions, we shall use the method 



This is an electronic reprint of the original article published by the ISI/BS in Bernoulli, 
2008, Vol. 14, No. 2, 419-439. This reprint differs from the original in pagination and 
typographic detail. 



1350-7265 © 2008 ISI/BS 



420 



Pavle Mladenovic 



of characteristic functions and also an approach based on the extreme value theory for 
stationary sequences; see Leadbetter, Lindgrcn and Rootzcn (1983). The problem we 
are going to consider is a generalization of the coupon collector's problem. Originally, 
the waiting time for all j's from Nn, supposing that all elements from Nn have equal 
probability to be drawn at each step, was named the coupon collector's problem. The 
limiting distribution for this problem was first determined by Erdos and Renyi (1961). 

A natural generalization of the coupon collector's problem is the problem of possible 
limiting distributions for the waiting time for a given portion of j's from N^- This problem 
was solved by Baum and Bilingsley (1965). Another generalization is the waiting time 
problem for a given number of appearances of all j's from Nn. The limiting distribution 
for this problem and some related results were obtained by different authors employing 
different methods; see, for example. Hoist (1986) and Mladenovic (1999, 2006). 

The problem of waiting time until a given portion of pairs jj, j G is sampled can 
also be considered by using an approach based on point process theory. For a presentation 
of this theory, see, for example, Chapter 3 of Rcsnick (1987). Let A„(/c) be the number of 
different pairs jj, j S iV„, sampled until to the fcth drawing and let {Ln{k),k > 1} be the 
point process determined by the indices where the process An jumps. Point process theory 
enables analysis of the asymptotic behavior of {L„(fc),/c > 1} and {A„(L„(fc)),fc > 1}. 
This approach was used in Chapter 4 of Rcsnick (1987) to study the structure and 
asymptotic behavior of records in a sequence of i.i.d. random variables with a continuous 
distribution function F. However, the underlying distributions in the problem that will 
be considered in this paper are discrete and depend on n. 

The paper is organized as follows. Section 2 contains preliminaries, necessary notation 
and auxiliary results concerning exact distributions of random variables that appear in 
connection with the problem considered. Main results on asymptotic distributions are 
formulated in Section 3. Proofs of theorems from Sections 2 and 3 are given in Sections 
4 and 5. 

2. Preliminaries, notation and auxiliary results 

Let Zi, Z2, Z3, ... be a sequence of independent random variables with the uniform 
distribution over the set Nn = {l,2,...,n}. Throughout this paper, we shall use the 
following notation: 

Xnj = min{fc : Z^-i = Zk = j}, j G iV„ is a fixed number, (2.1) 
Ynj = min{fc : Z^-i = Zk^a for some a G A C Nn, \A\ = j], (2.2) 

Mn = max{X„i, ^n2, ■ • ■ , Xnn}, (2.3) 

M,('=) = the fcth maximum of random variables Xni, ■ . ■ , A"„„, (2.4) 

where \A\ is the number of elements of a set A. Xnj is then the waiting time until the 
pair jj for some fixed j £ Nn occurs as a run in the process Zi, Zi,..., Ynj is the waiting 
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time until some pair aa, where aG A and \A\ = j, occurs as a run in the same process 
and Mn is the waiting time until all n pairs jj, j € Nn, occur. 

Let Ynn be the waiting time until the first pair jiji , where ji g Nn , occurs as a run in 
the process Zi, Z2, .... Let y„.„_i be the waiting time for the second pair j2j2, where 

j2 S iVi \ {ji}, after the occurrence of the first pair, etc. Then Ynj = Ynj for any j £ N„, 

where X = Y means that random variables X and Y have the same distribution. Let us 
denote by Sn,a„ the waiting time until a„ of the pairs jj, j £ Nn, occur, that is, 

Sn,a„ ^ ^171 + Y^n,n—1 + ' ' ' + ^i,Ti — a„ + lj G Nn- (2-5) 

It is obvious that the following relations hold: 

Ynl — Xnl = Xn2 = ■ ■ ■ = Xnn', (2-6) 



(2.7) 



= max{X„i, . . . , Xnn} = Mn; 
Sn.n-k+i = fc is a fixed positive integer; (2.8) 

P{Xnl > m, Xn2 >m,...,Xnj> 111} = P{Ynj > m} . (2.9) 

It is also clear that random variables y„„,y„.„_i, . . . ,Yni are independent, but random 
variables X„i,X„2, ■ ■ • ,Xnn are dependent. Let Fn{x) be the common distribution func- 
tion of random variables Xni, ■ ■ ■ , Xnn- As usual, $(a;) is the standard normal distribution 
function. First, we shall give exact distributions and related characteristics of random 
variables Xnj and Ynj and results concerning the asymptotic behavior of mean and vari- 
ance of the random variable Sn.a„ - 



Theorem 2.1. (a) The distribution of the random variable Xnj is given by 

p{Xn,^k}^ E ( ; )v-n) ^^-'^^ 



s=0 

,2/ 



(b) If Un ^ n {x + lnn) . then the following equality holds: 

lim n{l-Fn{un))= lim 7iP{X„j > u„} = e~^. (2.11) 



Theorem 2.2. (a) The distribution of random variable Ynj is given by 
n{(n + l)2 -4j}i/2 



PiYn, ^k}= ...U2 ■ I Ci) - ('i) "'i- k > 2, (2.12) 



where 



n-l + {(n + l)2-4j}V2 
hU)^ ^ , (2.13) 
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t2^h{j) = i^^-^ (2.14) 

(b) Exact values of the mean and variance of the random variable Ynj are given by 

Theorem 2.3. The asymptotic behavior of the mean fin o,nd the variance af^ of the 
random variable Sn,a„ is determined as follows: 

(a) if a„ — > oo and a„/n ^0 as n oo, then 

/i„ = — n^lnfl —\+o(nal/'^), a't^n^On as n ^ oo; (2-16) 

V n- J 

(b) if Un/n ^ A e (0, 1) as n — > oo, and Xq = A/(l — A), then 

M„ = -n^ln^l - — ^ +o(n3/2), al ^ Xqu^ as n ^ oo; (2.17) 

(c) if On/n — > 1 and fo„ = n — a„ — > oo as n ^ oo, then 

/z„ = — 7i^ln(l —]+ oin^bZ.^^^), a'i^n^/bn asn— >oo; (2.18) 

(d) as n oo, the mean and variance of the maximum M„ = 5'„„ are given by 

^M.^i^+n)ilnn + y) + ^^+^^-^^ + ^^+o(^^), (2.19) 

varil/„ — ^ + f y - 1 jn^ -3n2lnn + 0(n^), (2.20) 
where y = 0.5772156649 ... is the Euler constant. 

3. Main results 

The next three theorems give the limiting distribution of the random variable S'„,a„ for 
different types of asymptotic behavior of the sequence (a„). 

Theorem 3.1. // a„ = k for every n G N , where k is a fixed positive integer, then the 
random variable n~^Sn.an converges in distribution to a random variable whose charac- 
teristic function is 

f{t) = (1 + i2)-fc/2gifc.arcta„t^ ^3 -^^ 
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If On/n ^ A e [0, 1], a„ — > CX3 and 6„ = n — a„ ^ cx) as n — *■ oo, then {Sn,ar, ~ l^n)lo'n 
has asymptotically normal (0,1) distribution, where cr^ = var5'n,a„ and Hn = ^Sn,a^- 
Denote by Vn and the main terms of /i„ and ct^ , respectively, that are determined by 
(2.16)-(2.18). Using the Khinchinc lemma, we can conclude that the constants and cr^ 
can be replaced in limit theorems by i/„ and because cr„/T„ — > 1 and (/i„ — fn)/cn — > 
as n — > oo. More detailed results are provided by the following theorem. 

Theorem 3.2. (a) // a„ — > oo and an/n — » as n ^ oo, then 

hm P| Sn,a„+^^'Hl -anH ^ 1 ^ ^^^^^ 

(b) // an/n ^ A G (0, 1) as n oo and Aq = A/(l — A), then 

lim p( ^»,^.+"^Wl-«»/") <4^<,(,). (3.3) 

(c) // a„/ n 1 and &„ = n — a„ — > oo as n oo, then 

Jnn P| < . j . $(.). (3.4) 



Theorem 3.3. If n ~ a„ + 1 = k for a fixed positive integer k and all positive integers 
n, then the limitv 
following equality: 



n, then the limiting distribution of the random variable Sn,a^ = is given by the 



lim P{Mi'=) < n^{x + Inn)} = e"'="' V — -. (3.5) 

n — ^cxD ^ — ^ s' 



S 

s=0 



In particular, the maximum Mn = Snn has, asymptotically, the Gumbel extreme value 
distribution. 



4. Proofs of Theorems 2.1, 2.2 and 2.3 



Proof of Theorem 2.1. (a) It is easy to check that equality (2.10) holds for fc = 2 and 
fc = 3. The event {Xnj ~ k}, where k > 3, means that no two adjacent of the random 
variables Zi, Z2, . . . , Zk-3 take the value j and that Zk-2 7^ i, ^fe-i = Zk = j- Denote 
by As the event that exactly s of the random variables Zi, Z2, ■ ■ ■ , Z^s take the value 
j and no two adjacent of them take the value j. Then 

P(A.)=(^-;-2)(i-iy ' se{0,l,...,[fc/2]-l}. (4.1) 
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The following two equalities hold: 

[fc/2]-l 

{X„, = k}= y {A„Zk-2^j,Zk-i = Zk=3}, (4.2) 

s=0 

F{Zfc_2 7^ ^fc-i = ^fe = j} = f 1 - 4- (4-3) 

If fc > 3, then the equality (2.10) follows from (4.1), (4.2) and (4.3). 
(b) Using (2.10), wc obtain that 

i:=Tn+l ^ ^ ,s=0 ^ / \ / 

From problem 7(d), page 76 of Riordan (1968), the following identity holds: 

['•/2] / \ 1 r /I , \r+l /, 

where a = (1 + 4a-)^/^. The sum on the left-hand side of identity (4.5) is related to the 
Chebyshev polynomials. For x ~ l/{n — 1), we get 

4 \i/2 / 3\i/2/ ix-i/"^ 



"-(1 + ^ + - 1--) (4.6) 



and the tail 1 — (to) can be represented in the form 



where 



91= 1--H^ = 1- — + :t:7T + o — , 4.8 



•-(1 (4 9) 

Let us determine to from the condition n(l — Fn{m)) e^^ as n — > oo. Using (4.7), (4.8) 
and (4.9), this condition can be transformed in the following way: 

771 57TI 

— Inn -+ + 2 mn = — a: + o(l) asri^oo, (4-10) 

TO = n^(a; + Inn + o(l)) as n — *■ oo. (4-11) 
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Consequently, (2.11) holds for u„ = n^(.T + Inn). □ 
Proof of Theorem 2.2. (a) Let A be a subset of Nn, \A\ = j and let 

ao = l, bo^O. (4.12) 
For any positive integer /, let us consider the set S of all sequences of the form 

C1C2...Q, where ci,C2, ... ,Q e iV„, (4.13) 

such that no sequence from S contains a subsequence of the form aa, a G A. Let a; be 
the number of sequences from S for which c; G iV„ \ A, and hi the number of sequences 
from S such that q £ A. The following equalities then hold: 

ai=n-j = {n-j){ao+bo); (4.14) 

bi=3^jao + ij-l)bo; (4.15) 

Ofc-i = {n - i){ak-2 + bk~2), for any fc > 2; (4.16) 

bk-i = jak-2 + {j - l)bk~2, for any fc > 2. (4.17) 
Let si=ai + hi for Z > 0. It follows from (4.12) and (4.14)-(4.17) that 

SQ = ao + bo = l, si^ai+bi=n, (4.18) 
Sk-i = ak-i+bk-i = {n-l)sk-2 + in-j)sk-3 for fc > 3. (4.19) 

Hence, the sequence (s;) satisfies the linear difference equation (4.19) with initial condi- 
tions (4.18). It follows that 

Sk-i = Cit'l'^ + 6*2^2"^ for any fc > 1, (4.20) 

where ti =ti(j) and t2 =i2(j) are given by (2.13) and (2.14). Using initial conditions 
(4.18), we obtain the constants Ci and C2: 

Ci^Ul+fl-TT^) '^'V (4-21) 



(n + l)2 
(n + l)2 



Note that 



P{Y,,j^k} = bk-in-'', fce{2,3,...}, (4.23) 
bk-i = Sk-i - cik-i = Sk-i - (n - j)sfe_2. (4.24) 
Equalities (2.12) follow from (4.20)-(4.24). 
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(b) Let D = {{n + if - 4j}i/2. We then have 
Note that ^^''"^ = |g| < 1. We now get 

Since ii = (n - 1 + i:))/2, = (" - 1 - £')/2, 2n - ti = (3n + 1 - L»)/2, 2n - = (3n + 
1 + L')/2, n - ti = (n + 1 - L»)/2 and n - ia = (n + 1 + D)/2, the mean Ey„j can be 
represented in the form 

EYn,^^-—, (4.27) 
^ nD Vi' ^ ' 

where Ui and U2 can be transformed in the following way: 

Ui^{n-l + D){n+l + Df{3n + l-D) 

-{n-1- D)(n + 1- Df{3n+1 + D) 
= (r7.2 + 2nD + D'^ - 1) (Sn^ + 4n + 2ni:> + 1 - D^) 

- {n^ - 2nD + ~ l){3n^ + An- 2nD + 1-D^) 
~ 16nD{n^ + n). 

It also follows that Vi = {n+1 - Df{n+1 + Df ^ 16 f. The first of the equalities (2.15) 
follows easily from (4.27). Let us now determine vary„j. Wc have 

Since ^'9^^' = "'ii-ly^^' \q\ < 1, we obtain 

2 _ J / ^1 - 3nt? + 4n2ti if - 3ni? + 4n2ii 
^ ^ ■ I (n-ti)2 (n-ti)2 

j r (n - 1 + L»)3 - 6(n - 1 + L')^ + 16n2(77, -l + D) 



nD 1^ (n + 1 - D)3 

{71-1- Df ~6n{n-l- Df + 16n'^{n-l- D)]^ _ j U2 
(71 + 1 + D)3 J ^ ^ ' "i^' 

where V2 = (n + 1 - D)'''(n + 1 + = 64 f and 

[/2 = {(n - 1 + D)3 - 67i(n - 1 + D)^ + 167i2(7z - 1 + D)} • (ti + 1 + D)^ 
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- {(71 -l-D)^ - 6n{n - 1 - D)^ + 16n'^{n ~ 1 ~ D)} ■ {n + 1 - Df 
= [n^ + 2nD + - if - {n^ - 2nD + - if 

- 6n{{n + 1 + D){n^ + 2nD + - if 

- (n + 1 - D){n^ - 2nD + - if} 
+ 16n^{(n + 1 + Df{n'^ + 2nD + D'^ - 1) 

-{n + 1- Df{n^ - 2nD + - 1)}. 
Let M = n'^ + D^ -1 = 2n^ + 2n - 4j. Since D"^ = {n + if - 4j, we get 

U2 = 4:nD{{M + 2nDf + [M + 2nD){M - 2nD) + {M - 2nDf} 
- 6n{{n + 1 + D){M + 2nDf -{n + 1- D){M - 2nDf} 
+ 16n^{{n + 1 + Df{M + 2nD) -{n + 1- Df{M - 2nD)} 
= 4nD{3M^ + An^D'^) - 12nD{An{n + 1)M + + An^D^} 

+ 6An'^D{{n + 1)M + n{n + if + nO'^} 
= 6AnD{2n^ + An^ + 2n^ - in^j - nj) 
and, consequently. 



Ei;^, = . = -(2n* + An' + 2n' - 3n'j - nj). (4.29) 

The second of the equalities (2.15) follows from (4.29) and EYnj — {n^ + n)/ j. □ 
Proof of Theorem 2.3. For any positive integer n, let 

The equalities 

^" = ^"" + ^ + i-il? + Tij-2lk^' ('-'^^ 

ff,<'^ = ^-- + V^' (4-32) 
" 6 n n{n + l) ^ ' 

hold, where < e„ < 1 and < < 1. The equality (4.31) can be found in Graham, 
Knuth and Patashnik (1994), page 480, and (4.32) can easily be proven. Since 

E5„,a„ = {n^ + n){H,, - H^^-aJ, (4.33) 

relations concerning the asymptotic behavior of EiSn follow from (4.31) and (4.33). 
Similarly, using (4.31), (4.32) and the second of the equalities (2.15), we get relations 
concerning the asymptotic behavior of var5'„ □ 
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5. Proofs of Theorems 3.1, 3.2 and 3.3 

Let D = D{j) = {{n + 1)^ - Aj^/^ and let ti = ti{j) and = ^2(j) be given by (2.13) 
and (2.14). In the sequel, we shall use the following series expansions and approximations 
that follow from them: 

/ I 2.7 2j 2j + 2f , 2j + 6f \ 
D = n[l + ---^ + -^ -^ + —. , (5.1) 



n 

ii=n 1- — + ^ — + 5 ' 5.3) 

. , , j, i + f 3 + , . 

i2 = -l + j + 5 4 , (5.4) 

n-ti = -11 \ 5 ^ . (5.5) 

n \ n n'' / 

Proof of Theorem 3.1. Let us determine the characteristic function of the random 
variable F„j. Using (2.12), we obtain 



Kk=2 ^ ' k=2 ^ 

OO/ sfc . OO/ 



~ tiD' \-eHi/n\ ti' \- 0^2/11 

Consequently, we get that the characteristic function of the random variable Ynj can be 
represented in the form 



If j In — > 1 as n — > 00, then 
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and hence the asymptotic behavior of fYr,j/n{t) is given by 

g2it/n 

n-tie"/"' 



g2it/n 

/y„,/«W- ^ , ^->oo. (5.8) 



The relations 

|n-tie"/"P 



n — t\ cos lii sm — 

n n 



= + - 2nti cos - 



n 



{n ~ti) + 2nti 1 — cos — ) ^ 1 + t as n ^ oo,j/n — > 1, 



e^'* 2t -tisin(t/ri) 

arg/y„,/„ ~ arg . , = arctan -— — 

" n — <ie"/" 71 n — tiCOS(t/n) 

2t -t + o(l) 

= arctan — > arctan i as n — > oo 

n l + o(l) 

hold, where we again used the assumption that j /n ^ 1 as n — > oo. Hence, fYnj/n{t) ~* 
(1 + t^)~^/^e'''^'''^*'*'^* as n oo. Consequently, if a„ = fc, where fc is a fixed positive integer, 
we get that + • • • + Yn^n-k+i) /n converges in distribution to a random variable whose 
characteristic function is given by (3.1). □ 

Proof of Theorem 3.2. Using (5.6), wc obtain that the characteristic function of the 
sum S'„_o„ can be represented in the following way: 

n 

fs„,.Jt)= n fnj{t)=Pnl-Pn2-Pn3{t)-Pn4{t), (5.9) 
j— n — a^i +1 

where Pni, Pn2. Pnsit) ^nd Pn4{t) are given by 



Pni^ n (5-10) 

j— n— a„ +1 

Pn2= n fl+- + ^) (5-11) 

J— n — a^i + l 

P m TT /i ^2(j-) l^e'*-^i(j)/n ) 

"^"^^'^^ n ^-^^•l-e".^.(,)/4' (^-^^^ 



=2it 



^n4(0= TT ^ ■ (5-13) 
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Lemma 5.1. // a„/n — s- A G [0, 1] as n —y oo, then the following relation holds: 

lim P„i =c"^+^'/2. (5.14) 

n — ^oo 



Proof. The relations 

j Cl ti{j) j C2 



j—n — an-\-l 



(5.15) 



ln{l + x) = x + ri{x), \ri{x)\<\x\'^, for |a::| < 1, (5.16) 
hold, where constants ci and C2 do not depend on j. Consequently, we get 
IP 1 ■ 1 (27i-a„ + l)a„ a„ a„ 

J— n— a„ +1 

and the statement of the lemma follows easily. □ 
Lemma 5.2. // a„/n ^ A G [0, 1] as n ^ oo, then the following relation holds: 

lim P„2=c^-^\ (5.18) 

71 — *OD 

Proof. The statement of the lemma follows from the following relations: 

1 ^ / 2 1-4A 1 ^ 2n-4j 

2 ^-^ \ n / 2 ^-^ 

j—n — a„~\-l ^ ^ J— n— a„+l 

2 



□ 



Lemma 5.3. // t,j is the main term of a\ determined by (2.16)-(2.18), t„ > and 
a„ — > oo as n ^ oo, then for any real t the following equality holds: 

limP„3f— (5.19) 

n-.oo \Tn / 

Proof. Using (2.13) and (2.14), it is easy to prove that the following inequalities hold 
for any jg{l,2,...,n}: 

_1<M4<0<1-^<1. (5.20) 
n ti[j) n n 

For sufficiently large n, the inequality 1 ~ < cos 7- < 1 also holds and for such values 
of n, we obtain 
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|l_eit/-. .(t2/n)|2 l-{2t2/n)cos{t/T„) + tl/ 



2/„2 



< 



l + {2ti/n){l/n-l)+tyn'^ _ {I - ti/nf + 2ti/n^ 



1 - 2t2/n + tl/n'^ 



{i-t2/ny 



(5.21) 



< 



2 3 

n'^ n n 



Using the first of the inequalities (5.19) and the inequality (5.20), we obtain from (5.12) 
that argP„3(i/T„) and Pn3(t/T„) — > 1 as ti — > oo for every real t. □ 

Lemma 5.4. Let be the main term of cr^, t„ > anc? suppose that a„ — > cx3 arid 
6„ = n — a„ — *■ cxD as n — *■ cx). For any rea/ the following asymptotic relations then hold 
as n oo ; 



In 







Pni 1 


VT„) 










Pni 1 


VT„) 







4+2 



2^ ~ 2^ 
• cxp 



(if(2)_^(2) y 



Ae[0,l]. 



Proof, (a) We shall use the following inequalities: 
1 



1 h , 
-<-<l, 

n n 



^ t t" 

rr J < 1 — cos < ^ . 

2t2 24Ti - r„ - 2t2 



z ci n 
- + ^ < — 



1 - 



t^ 



< 1 - 



2.? , C2 



(5.22) 
(5.23) 

(5.24) 
(5.25) 

(5.26) 



where the constants ci and C2 do not depend on j. Inequalities (5.24) and (5.25) are 
straightforward exercises and (5.26) follows from the equality 



3' 



1 - 



2j2„2 



2{n + ly - 4j - 2n{n +1)1 + - + 



1 - 4? 



;\ 1/2 



Using the equality 



1 - ^c'*/^ 



= ( 1 - - I + ^ ( 1 - cos — 



(5.27) 



(5.28) 



and inequalities (5.24)-(5.26), we get 
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It follows from (5.13) that 



Pni ( — 
Tn 



1 - ^c"/-" 



< 1 
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(5.30) 



n ^ 



J 



1 



Finally, using (5.16) and (5.29)-(5.31), we get, as n— >oo, 



-1/2 



(5.31) 



In 



P„4 ( — 



y In + 4 



J— n— a„+l 



1 " 



2 2j n^f^ 



j=n-a„ + l ^ "-^ 



^_!i^fij(2)_ij(2) A 

2n2 2t2 ^ " 



(b) Using (4.32) and the main term of variance cr^ from relations (2.16)-(2.18); we 



get relation (5.23) 



□ 



Lemma 5.5. Ifvn and are the main terms of the mean /i„ = E5„ and f/ie variance 
al = var^n^a^, t„ > and S*^^^ = T~i(5'„,a„ - I'n), then 



aig fs*^ (t) — o{l) asn-^oo. 
Proof. The following equalities hold: 



(5.32) 



argF„4( — 

Tn 



2ta„ 



2tan 



± arg(l-^e-V^ 

n-a„+l ^ 



E 



arctan 



J— n— a„ +1 



Nn{t,j) 
DnitJ)' 



where 



iV„(i,j) = ^sin- 
n T, 



— , A.(i,j) 



, ti{j) t 
1 cos . 



(5.33) 



(5.34) 



Case 1. Let On — > oo, ^ ~> as ti oo. We then have ~ n^an, t„ = na]-!"^ and 
= — n2ln(l — — ). For Nn{t,j) and D~^{t,j), we obtain 



.7 



(5.35) 



Limit distributions for the problem of collecting pairs 433 

(5.36) 

(5.37) 



J 

Nn{t,]) tn 



_ J__ 


t^ 




n^ 


2jan 






t^ 


+ — 


n n^ 


2ja„ 





Dn{t,3) jai/^ 

In (5.35)-(5.37) and in relations that will follow C ~C{t) > is a constant which does not 
depend on j, and £ [—1, 1]. Also, note that i9 may be different at different occurrences. 
Consequently, we obtain the following results: 

„ / t\ 2ta„ ^ tn ( 1 2j t^ dC 

argP„4 - V -TTT l + 



2ta„ 

1/2 ^ 
nOn' j 


n 

E 

— n— a^i 


2ta}/^ 


tn / 


n 


1/2 \ 


2tal/^ 


t^ 



1/^ [ n n^ 2jan 



1 + -]{H,,-H,,^aj 



n 2arl'^ 



tn f l\i o.n\ t^n On ( a,,^ ^ 



y n J \ n J 2a^^^ n'^ \ n 



1 + r In 1-^ --^.^1-^ +0(1) 



i-n^' ^72 1-- 

tal/^ + o{l) asn^oo; 



n -^"j ( —Tj2 ) = ^«'/' + 0(1), 

j=n-a„+i \na. 



1/2 ; 11 ■'"■J I 1/2 

nan / j=„_a^^+i \na„ 
arg/s,-„ (t) = -^^ + toy2 + 0(1)^0(1), „ ^ oo. (5.38) 

Case 2. Let ^ — > A G (0, 1) as n — s- oo, and Aq = We then have t^^ = Aon"^, t„ = 
A^^n^/^ and i^n = -n^ ln(l - ^). We now get 

t ( j 99(7 1 
J 1^ n n- 2Xojn n^ J 
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(5.41) 



and, consequently, we obtain the following: 



'1 \- \ <^ n t- I- _ 



2to„ tn^/^ / 1 



_ 2ta^^ J ) + o(l) 

-^(^l + -j(i/„-ff„_„J+o(l) 
1 + - 



\l/2 V „, 



X nnn - ln(n - a„) + - ^7 r + —>+ o(l) 

^ Infl-— )+o(l) asn^oo; (5.42) 



j=n-a„ + l \Aq n ' / Aq \ / 



Xq' n-y^ Aq' 



Case 3. Let ^ — ^ 1 as n 00 and = n — a„ > for all ?i. We then have n*b, 
Tn = n^bn^^"^ and Vn = — ln(l - ^^) = In ^. We now get 



n ' 



N„it,j)^^\l-^ + —'>, (5.43) 
D-\t,j)^—{l + --A^-—^ + —l (5.44) 
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N,,{t,j) _tbl/' f ^ 1 2j t\, ^ dC\ ^^^g^ 
Dn{t,j) j \ n 2jn^ J 

and, consequently, we obtain the following: 

t \ 2tanbl/^ 'A tbl/'^ 1 2j tHn -dC 



argP„4 — = 2 — + 2^ V + 2" 



7 1 n n 2jn n 

j=n-a„+l ^ ■' 



1/2 



+ tblp(l + -){Hn-Hn-aJ 



^, ,1/2 +3l3/2 

.^J^^i^^H^p^H^^lj^ oil) 
tbl/^ + ^) (-^^ - ) + 

:t6y^(l + i)(w-ln6„ + i--^ + ...)+o(l) 
i&y^ln-^ + o(l) asn^cx); (5.46) 

bn 



arg n ( "^^"^^^'^6"'^''^"'^^' '^^oo; 



arg/s* (i) = + ^^V' In ^ + 0(1) = o(l), ti ^ ex.. 

Finally, using relations (5.9)-(5.14), (5.18), (5.19), (5.23) and (5.32), we obtain that 
in all cases considered, /s* ^ {t) — > e^*"/^ as n — > oo and therefore the proof of Theorem 
3.2 is completed. □ 



Proof of Theorem 3.3. We shall first prove a few lemmas. 



Lemma 5.6. Let X*j, j G {1, 2, . . . , n}, be independent random variables with the 
probability distribution, 
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and let M* = max{X*j^, . . . , X*^}. For every real x, the following equality then holds: 

lim P{A/* <?l^(a; + lnn)} = e~'="^ (5.48) 

n — >oo 

Proof. The statements P{M* < it„} e""^ as n oo and n(l — Fn{un)) ^ r as n ^ oo 
are equivalent. Hence, (5.48) follows from (2.11). □ 

Lemma 5.7. If j is a fixed positive integer and n — > oo, then the following asymptotic 
relation holds: 

9, ^. c-J'' r 7(x + lnn) //lnn\^M 
P{Yn,>n\x + \nn)}^-—ll + ^-^— L + oii—j jL (5.49) 

Proof. Let u„ = n'^{x + Inn) and r„ = ?i„ — [u„]. Using (2.12), we obtain that 

J ft 



(n-ti){(n+l)2-4j}V2 yn 



j (t2 

n 



(7l-t2){(n+l)2-4j}l/2 

= A,-A2. 



Using (5.1), (5.3) and (5.5), we obtain that 

In^i = Inj - ln(n - ti) -ln{(n+ 1)^ - 4^}^/^ + (u„ - r„)ln(ti/n) 

= Inj - Inj + lnn - ln( 1 - - + - Inn -ln(l + - + o(^ 



+ {n"(a; + lnn)-r„}ln( 1-^ + 4t + o(^ 



■ / 1 X 7(x + lnn) ^/Inn 
-j(x + lnn) + — -+0' 



and, consequently, 

A^^'-^[l + ^^^^^ + }. (5.50) 

Since t2/n ~ — 1/n as n ^ oo, A2 is negligible in P{Ynj > Un} = Ai — A2 and hence the 
equality (5.49) follows from (5.50). □ 
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Lemma 5.8. Let Pnj = P{Ynj > n'^{x + Inn)}. For any positive integer j > 2, the fol- 
lowing relation holds as n — > oo ; 

Pnj -Pn,j-lPnl = —TJ ■ 0{l). (5.51) 

71"' ~ 

Proof. Relation (5.51) follows from (5.49). □ 

Lemma 5.9. Let x be a real number and k and I positive integers, such that k + I <n. 
There then exists a constant C{x) such that for Un ~ n'^{x + Inn), the inequality 

(k+l \ / \ / k+l N 

f]{Xn^<Un}\-Pif]{Xn^<Un}\pi fl {Xn^<Un} 
1=1 ) \i=\ ) \t=fe+l / 

<C(a;)niin{fc,0^<^^ 

holds, that is, the condition D{un) is satisfied, where this condition is defined in Chapter 
3, Section 3.2 of Leadbetter et al. (1983). 

Proof. Let 

A„(fc,l) =P^p|{X„, <M„}^ - p{P\{Xnj <Un}^ ■P{Xn^k+l<Un}, 

P>j={Xnj>Un}, j = l,2,...,k and A = {X^^k+i > u^}. 

We then have 

A„(fc, 1) = P{D'i . . . DIA^) P{Dl . . . Dl)P{A^) 

= 1 - PiDi U---UDkUA)-{l- P{Di U • • • U Dk))il - P(A)) 
= PiiDi U---UDk)nA)- P{Di U • • • U Dk)P{A) 
= P{DiA U • • • U DkA) - P{Di U • • • U Dk)P{A). 

Using the inclusion-exclusion principle, we obtain that 

fc+i / h \ 

A„(fc,l) = ^(-1)'" ( 1 ) (Pnm -Pn,m^lPnl). (5.52) 

m=2 ^ ^ 

Using (5.51) and (5.52), we get the statement of the lemma, first for integers k and 1 = 1, 
then for arbitrary k and I, where k + l <n. □ 
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Lemma 5.10. For u„ = = + Inn), the condition D'{un) is satisfied (this 

condition is defined in Chapter 3, Section 3.4 of Leadbetter et al. (1983)): 



i/k] 



lim limsupn ■ > P{X„i > u„,X„j > m„} = 0. 



(5.53) 



J=2 



Proof. It follows from (2.9) that P{Xni > u„, X„j > u„} = P{i^n2 > "n} holds for every 
J > 2. Hence, as n — > oo, we get 



[n/k] 
J=2 





n 




( 




-0 




k 



-22- 



-(1+0(1)) 



-2x 



-(1 + 0(1)) 



and, consequently, (5.53) holds. 

Now, Theorem 3.3 follows from Lemma 5.6, Lemma 5.9, Lemma 5.10 and Theorem 
5.3.1 from Leadbetter, Lindgren and Rootzen (1983). □ 
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